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Abstract 

The numerical properties of staggered Dirac operators with a taste-dependent mass term 
proposed by Adams [T][2] and by Hoelbling [3] are compared with those of ordinary staggered 
and Wilson Dirac operators. In the free limit and on (quenched) interacting configurations, we 
consider their topological properties, their spectrum, and the resulting pion mass. Although 
we also consider the spectral structure, topological properties, locality, and computational 
cost of an overlap operator with a staggered kernel, we call attention to the possibility of 
using the Adams and Hoelbling operators without the overlap construction. In particular, the 
Hoelbling operator could be used to simulate two degenerate flavors without additive mass 
renormalization, and thus without fine-tuning in the chiral limit. 



1 Introduction 



The spontaneous breakdown of chiral symmetry plays a central role in the spectrum of light 
hadrons. Since it is an intrinsically non-perturbative phenomenon, the only way to study it from 
the first principles of QCD is via the lattice regularization. Yet, already many years ago Nielsen 
and Ninomiya proved that a translationally invariant, local lattice formulation of the QCD Dirac 
operator D, retaining chiral symmetry in the massless limit, and with the correct number of 
physical fermionic degrees of freedom, is forbidden [4]. This no-go theorem can be circumvented, 
by constructing lattice fermions satisfying a modified form of chiral symmetry [H] , and obeying the 
Ginsparg- Wilson relation [6] . Although explicit formulations of lattice Ginsparg- Wilson fermions 
are known [7], currently their practical use in realistic, large-scale lattice QCD simulations is still 
limited, due to the high computational overhead. 

The most widely-used lattice discretizations of the Dirac operator are either based on the addition 
of a second-derivative term to the kinetic part of the quark action [8] to remove (or "quench" ) the 
unphysical "doubler" modes in the continuum limit by giving them a mass 0(a~ 1 ), or on a site- 
dependent spin diagonalization, which leads to the so-called staggered formulation [9j . The former 
approach introduces an explicit breaking of chiral symmetry, and, as a consequence, an additive 
renormalization of the quark mass, which has to be fine-tuned. In contrast, the staggered operator 
preserves a remnant of chiral symmetry (sufficient to forbid additive mass renormalization), and 
leads to a reduction of the matrix size. However, the staggered formulation only removes part 
of the unphysical modes, reducing the number of quark species in four (d) spacetime dimensions 



from 16 (2 d ) down to four (2 d / 2 ) "tastes", which become degenerate 10 (and consistent with the 
properties related to the global symmetries of the continuum Dirac operator [IT]) in the a — > 
limit. In order to simulate QCD with two light fermions, one then has to apply the so-called 



"rooting trick" , which has been a subject of debate for the last few years 12 



Some recent works have discussed the idea of using a staggered kernel with a taste- dependent 
mass term to obtain two (or one |3[|13|) massless fermion species. Such formulation, which is one of 



the various approaches aiming at minimally doubled fermions 14 , could combine the advantages of 
the overlap construction with the computational efficiency of a staggered kernel. Furthermore, this 
formulation appears to be particularly attractive from the point of view of topological properties [l] . 

Using a staggered operator with a "flavored" mass term as the kernel in an overlap construction 
is a very appealing idea, but the properties of such operators (with various taste-dependent mass 
terms) are interesting on their own. In fact, while the overlap construction completely removes 
the need for fine tuning to achieve massless fermions, it still leads to a considerable computational 
overhead. In contrast, using a staggered operator with taste-dependent mass a la Wilson requires 
fine tuning to obtain exactly massless modes, but, by virtue of the reduced size of the operator, 
may still be a computationally competitive alternative to the usual Wilson discretization, while 
avoiding the rooting prescription. 

This motivation led us to address a numerical investigation of different operators of this type 
that we present here (preliminary results have appeared in |13| ). In the following, we present 
a systematic classification of the possible taste-dependent mass terms, discuss their analytical 
features in the free theory, and then move on to the interacting case that we study via numerical 
simulations. We perform an elementary measurement of the pion mass on a set of quenched 
configurations, and verify the expected PCAC behaviour as one approaches the chiral limit. In 
an Appendix, we also explore the properties of the staggered overlap operator proposed in [l], 
in comparison with the usual overlap based on the Wilson kernel. In particular, we compare the 
locality of the operators, and the computational cost of applying them to a vector and of solving 
for the quark propagator. 

The structure of this paper is as follows. First, in sec. [2] we recall theoretical aspects of the 
construction of taste-dependent mass terms, and discuss their spectral structure in the free field 
case. Then, we address the interacting case, presenting our numerical studies in sec. [3] We 
summarize our findings and discuss their implications for possible future, large-scale applications 
of these operators in sec. [4] Finally, in the appendix |A"} w e report on our study of an overlap 
operator based on a staggered kernel, as proposed in ref. 111. 
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2 Theoretical formulation and general features 



d 



The staggered operator [9j 

Acs = ^X>(^-^) (l) 

with Tj^ix) — (— l)^"<c x " and (V^) a = U^(x)5 x + a f l . y , is a computationally very efficient way 
to discretize the massless QCD Dirac operator on a d-dimensional Euclidean hypercubic lattice 
of spacing a. This operator is invariant under a global U(l) symmetry, which can be interpreted 
as a remnant of chiral symmetry: in fact, Z?ks anticommutes with the operator 1^55 defined by 
Q^55)x y = (~1)^" =1 x " ■ in the free theory, one can easily see that in four dimensions the operator 



r 55 has 75 ® 75 structure in spin-taste space 15 . The construction of Z?ks is based on a local 



spin diagonalization, which, for the four-dimensional case, allows one to reduce the number of 
fermion components by a factor of 4 with respect to the naive operator, and yields four tastes 
in the continuum limit. The degeneracy between these four tastes is explicitly broken by gauge 
interactions at finite lattice spacing a, but is recovered in the continuum limit a — > 0. 

Recently, various works explored the idea of using staggered operators with taste-dependent mass 



terms Illl3 13 . Following, e.g., the discussion in the classic paper by Golterman and Smit 16 , the 



possible matrix structures (in taste space) for a mass term can be classified as 

• 1 ("0-link"), of the form 5 x>y 

• 7a ("1-link"), involving a sum of terms, each containing 1 link U p 

• &ui3 ("2-link"), involving a sum of terms, each containing 2 links UfJJ v 

• 757a ("3-link"), involving a sum of terms, each containing 3 links U^U V U P 

• 75 ("4-link"), involving a sum of terms, each containing 4 links U p U v U p U a 

It is highly desirable to preserve the symmetry = D\ because it guarantees that 

det-D is real, and non- negative (in the absence of real negative eigenvalues), thus avoiding a "sign 



problem" in the measure 17 . This symmetry is satisfied only if the hcrmitian mass term connects 
sites of the same parity. Thus, we do not consider the 1-link or a 3-link mass terms further. 

This leaves three possibilities: 0-, 2- and 4-link mass terms)]] The 0-link mass term corresponds 
to the usual staggered operator, with a taste-independent bare mass 

D = D KS + m. (2) 



The staggered operator with a 2-link mass term, which was discussed in refs. 3fl3 , can be written 
in the form 

D 2 = D KS + -?- (M u + M13 + M14 + M 23 + M 24 + M 34 ) , (3) 
v3 

where (following the notation of ref. (3j) 

M llv = ir\ liV C tlv , (4) 

(V^)x,y = (Vun)x, V = ( -1 ) X<< W> for M < v , ( 5 ) 
i=p,+ l 

Cfiu — — [C^C V + C v Cp) , (6) 

C, = \(V, + Vl). (7) 

x It is also possible to consider the above matrix possibilities with an extra factor |16| . In that case, 75- 

hermitian mass terms are obtained in the 0-, 1- and 3-link cases. However, we did not find a continuum-like 
dispersion relation for the real modes in any of these cases. 
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Figure 1: Left panel: Spectrum of -Dks m the free limit. Central panel: Free spectrum of operator 
Di (eq.([3|), which includes a taste-dependent mass term with tensor-like structure in taste space 
(i.e., a 2-link mass term). Right panel: Free spectrum of operator (eq.Q), which includes a 
taste-dependent mass term with 75 structure in taste space (i.e., a 4-link mass term). 



Finally, a staggered operator featuring a mass term with 75 structure in taste space [T] can be 
written as 

At = -Dks - -r 55 r 5 , (8) 



with 
where 



T5 = V5C, 



11=1 



(9) 
(10) 



while C is the average of four-link parallel transporters joining sites at opposite corners of the 
elementary lattice hypercubes 

C^^C^C^. (11) 

perm 

Note that the mass term appearing on the r.h.s. of eq. {[sj) is Hermitean and commutes with T^. 

To understand the properties of these three different types of operators it is instructive to start 
by discussing their spectra in the free limit. The three panels in Fig. [TJshow the structure of the 
spectrum of eigenvalues for Dks (for Dq, the spectrum is just trivially shifted by m), for D2, and 
for D4 in the non-interacting case. 

In the free limit the eigenvalues of Dq on a lattice with sites along the /i direction read 



A = m ± i A 



, ^sin 2 p M , with: 
\ »=i 



2tt 



+£/»). fc M e {0,1,2,..., LJ2}, 



(12) 



with eight degenerate eigenvalues of both signs, and where e M = (1/2) if the fermionic field 
satisfies (anti-)periodic boundary conditions along the fj, direction. 
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For £>2 the free eigenvalues take the form (for p — 1) 



\ l =±\jA 1 - v i±2i^A^, (13) 

and 



A 2 =± V M 2 - : p 2 ±2^v / I^ (14) 

in which the ± signs are chosen independently and the eigenvalues are doubly degenerate, and 
having defined 

4 

p 2 = ^sin 2 p M , (15) 

A x = ^ + C ' C ' + C ' C ' + ^ + C ' C ' + C ' C ' - 2c = + 0(a 2 ), (16) 
3 

M = c 2 c 2 + c 2 c 2 + c 2 c 2 + c 2 c 2 + c 2 c 2 + c|cl + 2c = 4 + 0(q4)) (17) 
3 

where c^ = cosp^, and c = C1C2C3C4. Expanding for small momenta gives 



Ax = iV-p 2 = ±ip, A 2 = ±2 v /l±ip = ±2±ip, (18) 

so that at low momenta, the eigenmodes corresponding to A 2 get a mass of ±2, while the eigenmodes 
corresponding to Ai are massless. 

Finally, the free spectrum of D4 reads: 

Ai = -c^ ±iy/fP, A 2 = +c^ ± iy^, (19) 
a a 

Note that, in the continuum limit, the point where the spectrum of the Z?ks operator intersects 
the real axis corresponds to four massless modes. By contrast, D 2 leads to one mode in each of 
the two intersections away from the origin, and two at the origin. Finally, for D4 one obtains two 
modes at each of the two intersections of the spectrum with the real axis. 

The taste chirality of the eigenmodes ^ of D4 and D 2 , is given by (^ , r 5 5r5\E'), where r 55 = 75(8175 
exactly, and = 75(811+0(0) in spin (g> taste. The taste chirality of the eigenmodes corresponding 
to eigenvalues Ai is c, while that of the A 2 -eigenvectors is — c. This is also depicted in Fig. [2] one 
can see that the taste chirality of the real eigenmodes is ±1, and is the same (+1 or —1) for 
all modes in a given branch of the D4 or Z) 2 spectrum. The implications of a well-defined taste 
chirality have been stressed in j2j: if TssTs sa ±1, then ^5 w iTg, so that the spin chirality of 
the real eigenmodes can be probed by T^. This is the reason why the index theorem applies to 
D 2 and D4, while it does not for Dks (where both the ±1 taste chiralities lay on the same single 
branch.) 

A shift of the spectra by a real value can thus lead to chiral low-momentum zero modes in each 
branch, and hence to the possibility of constructing an appropriate index. A common way to study 
the index consists of looking at the flow of eigenvalues A(m) of: 

H(m) =75(13 + m). (20) 

In general, if (D + m) has a zero-mode \^&o) for m = mo, then, correspondingly, H has a vanishing 
eigenvalue A(mo) = 0. With a small perturbation of m away from mo, i.e. m = too + <5to, at leading 
order the eigenvalues get displaced by an amount ( x Po|75( m — "icOl^o): namely one finds crossings 
A(to) = ±(to - to ), if |*o) is a chiral mode: (* |75|*o) = ±1- The saturation of (§r 55 r5*) at 



value ±1 discussed above allows us to trade T 5 for r 55 and use the latter in eq.(20). 

An alternative way to look at the spectral flow was proposed in ref. [l] for the D4 operator, by 
studying the eigenvalues oQ 

H( P ) = r 55 D KS - ^r 5 . (21) 

a 



2 Actually, R ef. |l| proposed to consider the spectral flow of (iDks — £^5)- As recognized in |2j, that operate 
the same as eq. to a redefinition of the r? M phase factors. 
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Figure 2: (Left) Taste chirality properties of the D2 and D4 eigenvectors, as a function of (the 
component of minimum modulus of) their momentum, in the free limit. On an infinite lattice, 
the eigenvectors associated with real eigenvalues have vanishing momentum and a well-defined 
taste chirality ±1. For eigenmodes corresponding to eigenvalue Ai, the taste chirality becomes 
+1, while for A2-eigenmodes the taste chirality approaches —1. (Right) The taste chiralities of the 
eigenmodes of the D2 operator; the size of the points corresponds to the magnitude of c, while the 
color indicates the sign: blue for +c, red for — c. 



Fig. 3 displays a comparison of the two different ways to define the spectral flow for the D4 operator: 
the plots in the top row show the flow of eigenvalues of If as a function of p (eq.(21|), whereas 
those in the bottom row refer to the "standard" definition of the flow, using eq. ( 20 ) . In each row, 
the left panel displays the results from a cold (i.e., free) configuration on a lattice of size 16 3 x 32, 
while the central panel is obtained from a cooled configuration of topological charge Q = 1 on a 
lattice of size 8 4 , and finally the right panel displays the results from a "rough" (i.e., non-cooled) 
quenched Q = — 1 configuration at (3 = 6/g 2 — 6, on a lattice of size f2 4 . In the latter case, the 
comparison of the two flow definitions shows that, with the standard definition, the region around 
the real axis is populated by a large number of eigenvalues, preventing one from identifying the 
crossing with accuracy. 

Next, it is interesting to compare the identification of the index, using the spectral flow defined 
from eq. (20), for staggered fermions with a taste-dependent mass term, and for conventional 
Wilson fermions. This is shown in Fig. [4] the left, central and right plot in each row show the 
spectral flow for D4, D2 and a standard Wilson operator, respectively, while the three different 
rows, from top to bottom, refer to a cold configuration, to a cooled Q = 1 configuration, and 
to a non- cooled Q = — 1 quenched configuration at (3 = 6. It is interesting to observe that, as 
expected, the spectral flow on a cooled instanton configuration clearly reveals Nf x Q crossings. 
However, one already sees that in the plots of the (3 — 6 configuration the gap tends to close. This 
is especially the case for the D4 operator, and is related to the properties that will be discussed in 
Section [U 

The overall message that can already be drawn from these observations (before addressing a full- 
fledged numerical investigation) is that the gauge field fluctuations in interacting configurations 
reduce the width of the gap in the spectrum, and blur the distinction between light modes and 
doublers. 
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Figure 3: Comparison of the spectral flow for the D4 operator as obtained from the eigenvalues of 
the operators defined in eq. (21 1 (top row panels) and eq. (20) (bottom panels). The three plots 
(from left to right) in each row show, respectively, the eigenvalues of H (or H) as a function of 
p/a (or m) from a free configuration on a lattice of size 16 3 x 32, from a cooled configuration of 
topological charge Q — 1 on a lattice of size 8 4 , and from a non-cooled Q = — 1 gauge configuration 
at f3 = 6, on a lattice of size 12 4 . 



3 Numerical investigation on interacting configurations 

As we showed in the previous section, the fluctuations in typical interacting configurations lead to a 
filling of the gap in the spectral flow for the various lattice Dirac operators that we are considering, 
making a proper identification of the index difficult. A related effect can also be seen directly in 
the spectra of the operators: the panels in Fig. [5] show a comparison of the spectra of D4 (top row) 
and D2 (bottom row) in the free case (left), and in interacting configurations at = 6 (central 
panels, in which different values of p/a or m are used) and at j3 = 5.8 (right). The figure shows 
evidence for the superior robustness of lattice fermions based on the D2 operator, over D4: at 
j3 = 5.8 for example, a gap remains clearly visible for D 2 , while it has all but disappeared for D4. 

This can be understood from the fact that, since D4 involves 4-link parallel transporters, it is 
more sensitive to the gauge field fluctuations in interacting configurations than D2 which involves 
2-link transporters only. 

However, for practical applications in large-scale simulations, it is important to remark that, as 
usual, the effect of gauge fluctuations can be considerably reduced through some suitably optimized 
smearing procedure. 

Next, we considered the effectiveness of these operators for spectroscopy calculations. To this 
end, we performed a simple test, by studying the mass mps of the lightest meson in the pseudoscalar 
channel (the pion). We computed the quark propagator G(x,y, z,t) from a point source, on 
quenched configurations at j3 = 6 on a lattice of size 16 3 x 32, then we evaluated the p = 
correlation function 

C(t) = £ G(x, y, z, t)T 55 G(x, y, z, i)tr 55 = ]T \G(x, y, z, t)\ 2 , (22) 

xyz xyz 

and extracted amps searching for the large-time plateau in the effective mass plot, as a function of 
t. Monitoring the behavior of {amps) 2 as a function of (am), one can study the partially conserved 
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Figure 4: Spectral flows for D4 (left), D 2 (center) and a standard Wilson operator (right), on a 
cold configuration (top), on a cooled Q — 1 configuration (middle), and on a non-cooled Q = — 1 
quenched configuration at /3 = 6 (bottom). The solid blue lines in the top row show analytic 
results. 
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Figure 5: Spectra of D4 (top) and D2 (bottom) on different types of configurations. As compared 
to the free case (left) , the gap in the spectrum of eigenvalues of D4 on interacting configurations 
tends to close more rapidly than in the case of £>2- The second and third plot in each row are 
obtained from quenched configurations at f3 — 6 (in the third plot on the top row, symbols of 
different colors correspond to different values of p/a). Finally, the plots on the right are obtained 
from a coarser lattice, at f3 = 5.8. 
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Figure 6: The exponential decay of the correlation function associated with the lightest meson in 
the pseudoscalar channel for Dq (left), for D 2 (center) and D4 (right panel) on a free configuration, 
for different values of the bare quark mass. 



axial current and the issues related to mass renormalization. 

Fig. [6] shows the correlators obtained on a free configuration, for Dq (left panel) , for D 2 (central 
panel) and for D4 (right panel). As expected, the D 2 operator leads to a massless pion for both 
am = and am = 2. 

For an interacting configuration (at (3 = 6), the comparison between D 2 and D4 shown in 
Fig. [7] reveals that for D 2 one obtains a massless pion at approximately am ~ 1.15, while for 
D4 the same happens for am ~ 0.25. Comparing these numbers with the values of the bare 
masses corresponding to a massless pseudoscalar state in the free limit (1 and 2 respectively), 
these results give an indication that the mass renormalization is more pronounced for D4 than for 
D 2 . Quantitatively, one can observe that the renormalization factor grows exponentially with the 
length of the parallel transporters used: (0.25/1) 1 / 4 ~ (1.15/2) 1 / 2 , in agreement with the fact that 
D4 involves 4-link terms, as opposed to D 2 , in which the mass term is constructed from 2- link 
terms. 

Remarkably, with the D 2 operator, the pion mass shows a square-root behaviour of three different 
kinds: one can approach the critical bare quark mass to amo ~ 1-15 from the left or from the right, 
i.e. from the inside or the outside of the D 2 spectrum. But one can also approach the other critical 
quark mass am = 0, corresponding to the central branch of the spectrum, which remains zero as 
in the free case by symmetry of the average spectrum. The transition from one branch to another 
seems rather abrupt, and the scaling of the pion mass can be observed over a broad range of quark 
masses approaching zero. 

The lesson is that D 2 may provide a cost-effective way to simulate Nf = 2 light quark species, 
without fine-tuning of the bare quark mass to approach the chiral limit. 



4 Conclusions 

In this work, we performed a numerical study of staggered Dirac operators with a taste-dependent 
mass term. We restricted our attention to operators including mass terms with tensor or pseu- 
doscalar structure in taste space: their 75-hermiticity properties are such, that their eigenvalues 
come in complex conjugate pairs (as is the case for the usual staggered Dirac operator), leading to 
a real fermionic determinant, which is non-negative in the absence of negative real eigenvalues. 

Such operators were proposed by Adams [TJ2] and by Hoelbling [3] . We compared their properties 
both in the free limit and on interacting configurations at typical values of the gauge coupling. 

Our results show that these operators can indeed be used to separate the low-lying modes and 
reduce the number of tastes, in a way characterized by well-defined topological properties. Our 
study of the spectral flow reveals that, for the 4-link operator (with a taste-pseudoscalar mass term), 
the gap in the eigenvalue spectrum tends to close rather early, obstructing an easy identification 
of the eigenvalue crossings, which are related to the index. As one might have expected, the 2-link 
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Figure 7: Tuning of the bare quark mass to obtain a light pion: the two plots show the pion mass 
as a function of the bare quark mass, for D2 (left panel) and D4 (right panel) at /3 = 6. 

operator shows markedly more robustness to gauge fluctuations. 

We also performed an elementary study of pion propagators, which shows that the lightest 
meson is rather easy to isolate without explicitly disentangling spin and taste degrees of freedom. 
Approaching the chiral limit requires in general the fine-tuning of an additive mass term, as for 
Wilson fermions. One important exception occurs for the 2-link operator: if one chooses the 
middle branch of the spectrum, one can study a theory with two tastes, where the additive mass 
renormalization vanishes due to the symmetry of the spectrum. Therefore, no fine-tuning is needed. 

Although the 2-link operator was designed to produce a single taste (with a fine-tuned additive 
mass), it may well be that its most promising use is to simulate two tastes without additive mass 
renormalization. 

Finally, we studied the properties of an overlap operator with a -D4 kernel (see Appendix). We 
found that its locality properties are similar to those of the operator based on a Wilson kernel. 
As it concerns the computational cost for a quark propagator calculation, we found that, in the 
free limit or on very smooth gauge configurations, the inversion of the operator based on a kernel 
with a four-link mass term is almost one order of magnitude faster than using an overlap with 
Wilson kernel. However, we also observed a significant loss of efficiency on interacting (quenched) 
configurations at j3 = 6, where the operator with the D4 kernel is only approximately twice as fast 
as that with a Wilson kernel. The reason for this can probably be traced back to the fact that the 
four-link transporters in the mass term are more sensitive to the effect of the fluctuations in gauge 
configurations on coarser lattices. Our crude assessment indicates that this new, staggered, overlap 
operator does not bring a major computational advantage over a Wilson kernel, while producing 
two degenerate flavors, but without the full SU(2) flavor symmetry. 

Two copies of an overlap operator with a kernel based on Hoelbling's 2-link operator would give 
more flexibility, e.g. that of simulating two flavors with unequal masses, for a similar computer 
effort. 
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A Staggered overlap operator 

We also studied the properties of an overlap operator based on a staggered D4 kernel, as originally 
proposed in [lj. The construction is completely standard: 

* I (A.1) 

and leads to two exactly massless physical fermions in the continuum limit, without fine-tuning. As 
compared to a conventional overlap operator based on a Wilson kernel, the potential advantages 
of this construction are related to the reduced kernel size (D4 is a matrix of size four times smaller 
than a Wilson operator on the same lattice). 

To understand the effectiveness of an overlap operator with a D4 kernel, the first important 
issue to be discussed is the locality of the operator. As is well-known, an overlap operator is not 



ultra-local 18 . Its locality properties can be studied by looking at the decay of its matrix elements 
between source and sink at sites x and y (which we denote as M x<y , where, for simplicity, we only 
show the indices corresponding to the site coordinates), as a function of the distance between x 



and y 19 . To this end, in the two plots at the top of Fig. 8 we show the decay of |M Xi3/ | against 
\x — y\\, the 1-norm distance (or "Manhattan distance") between the sites x and y, comparing the 
matrix elements of an overlap operator obtained using a D4 kernel (left panel) or a conventional 
Wilson kernel (right panel) . Although the D4 kernel is less local than the Wilson kernel, the locality 
properties of the corresponding overlap operators are comparable. This appears quite clearly in 
the plots displayed at the bottom of the figure, in which the results for the two operators are shown 
together, for a cold configuration (left panel) and for a configuration at f3 = 6 (right panel). 

Another important factor in the efficiency of a lattice Dirac operator is the cost of applying the 
operator to a vector. The multiplication by the kernel is about twice as fast, if one uses D4 instead 
of a Wilson kernel (staggered fields do not have an explicit spinor index but D4 has twice as many 
non-zero elements as the Wilson operator). In the computation of the sign of H = r 55 I?4, using 
the conjugate gradient (CG) method, and no deflation, the gain with respect to a conventional 
Wilson kernel is a factor ranging from approximately 2-3 to about 8. However, these numbers are 
only gross estimates, and could be improved, e.g., by optimizing the parameters of the D4 kernel. 
Similarly, an improved form for the kinetic operator, link smearing (for the kinetic and/or the mass 
term), and standard tricks related to deflation, preconditioning, etc... could be applied. 

To discuss the computational cost of the inversion of the operator, we compared the two overlap 
operators on the same pure-glue, /3 = 6, background on a 12 4 lattice, using the same, basic, 
inner/outer CG algorithm. In our computation, we evaluated the propagator as the solution of the 
equation: 

{D ov + m)\D ov + m)x = (D ov + m) ] b (A.2) 

using a conjugate gradient (CG) iterative solver: at each iteration, sign(iZ) is applied to a vector 
v through a 2-pass Lanczos process. One builds a tridiagonal matrix T and takes the signs of its 
eigenvalues (which are representative of those of H), then reconstructs sign(_ff)u, as described in 



ref. 21 . The results are displayed at the top of Fig. 8: the three plots (from left to right) show 
the computational cost for the outer CG iteration, for the matrix-times-vector multiplication, and 
the total CPU cost. For comparison, we also show the analogous results in the free-field case, in 
the plots at the bottom of the figure. This comparison shows that the computational advantages 
expected from elementary arguments, and observed in the free limit, turn out to be dramatically 
reduced in "realistic" interacting configurations. Again, this reduction points to a reduction in the 
eigenvalue gap of D4. 
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Figure 8: The plots at the top of the figure show the decay of the matrix elements of an overlap 
operator obtained using a D4 kernel (left) or a Wilson kernel (right), as a function of the 1- norm 
distance between the sites. The results from the two operators are displayed together in the two 
bottom plots, for a cold (bottom left panel) and a f) = 6 configuration (bottom right panel). 
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Figure 9: Top row: Comparison of the computational costs for the overlap operator inversion at 
a given level of precision, using a D4 (red points) or a Wilson (green symbols) kernel: the left 
panel displays the costs related to the outer CG iteration, the central plot shows the costs of the 
matrix-times- vector multiplication, and finally the right panel displays the total CPU cost. For 
comparison, the plots in the bottom row show the analogous results for a free configuration. 
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